Arabic Questions Classification Using Modified TF-IDF

نویسندگان

چکیده

Classifying the cognitive levels of assessment questions according to Bloom’s taxonomy can help instructors design effective assessments that are well aligned with intended learning outcomes. However, classification process is time consuming and requires experience. Many studies have attempted automate by utilizing different machine text mining approaches, but none has examined Arabic questions. The purpose this study examine research gap introduce a new feature extraction method would better suit their unique characteristics. It also aims provide Arab tool them automatically classify To accomplish purpose, developed dataset more than 600 then proposed modified term frequency-inverse document frequency (TF-IDF) for extracting features from Unlike traditional TF-IDF, was designed take nature into consideration. evaluated comparing it two methods been used classifying English questions, i.e., TF-IDF called part-of-speech-inverse (TFPOS-IDF). A t-test utilized whether difference in performance between three statistically significant. outperformed other methods. overall accuracy, precision, recall scored were significantly higher those TFPOS-IDF evaluation results indicate promising potential method, which be extended languages.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The DF-ICF Algorithm- Modified TF-IDF

The tf-idf is an algorithm which is generally used where massive data processing is done. Tf-idf is the weight given to a particular term within a document and it is proportional to the importance of the term. This paper aims to use the idea behind the tf-idf algorithm to design the df-icf algorithm which finds the importance of a particular document within the given corpus. General Terms DF-IC...

متن کامل

Automatic Mood Classification Using TF*IDF Based on Lyrics

This paper presents the outcomes of research into using lingual parts of music in an automatic mood classification system. Using a collection of lyrics and corresponding user-tagged moods, we build classifiers that classify lyrics of songs into moods. By comparing the performance of different mood frameworks (or dimensions), we examine to what extent the linguistic part of music reveals adequat...

متن کامل

Clustering scRNA-Seq Data using TF-IDF

In this abstract, we propose several computational approaches for clustering scRNA-Seq data based on the Term Frequency Inverse Document Frequency (TF-IDF) transformation that has been successfully used in the field of text analysis. Empirical evaluation on simulated cell mixtures with different levels of complexity suggests that the TF-IDF methods consistently outperform existing scRNA-Seq clu...

متن کامل

Semantic Search Engine using Joomla Framework with Modified tf-idf and TRApriori Algorithm

As the amount of data available in a repository increases, content retrieval from the huge data stored in the repository becomes a tedious task. Though Content Management System helps us to manage the data, yet searching the relevant data is still a daunting task. For that, we need efficient Search Algorithms for maximizing the correlation between data required and data returned by semantic sea...

متن کامل

On a robust document classification approach using TF-IDF scheme with learned, context-sensitive semantics

Document classification is a well-known task in information retrieval domain and relies upon various indexing schemes to map documents into a form that can be consumed by a classification system. Term Frequency-Inverse Document Frequency (TF-IDF) is one such class of term-weighing functions used extensively for document representation. One of the major drawbacks of this scheme is that it ignore...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3094115